Recognition of Printed Sinhala Characters Using Linear Symmetry
نویسنده
چکیده
Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised through a multi-level filtering process using the Linear Symmetry [LS] feature [1]. The recognised character is then segmented to identify the associated modifier symbol/s. Since the use of LS recognises characters prior to segmentation, the most difficult task of separating touching characters is easily solved. A method to determine the skew angle of the script is also presented. Experiments conducted so far for widely used fonts of different sizes yield encouraging results.
منابع مشابه
A segmentation-free approach to recognise printed Sinhala script using linear symmetry
In this paper, a novel approach for printed character recognition using linear symmetry is proposed. When the conventional character recognition methods such as the arti1cial neural network based techniques are used to recognise Brahmi Sinhala script, segmentation of modi1ed characters into modi1er symbols and basic characters is a necessity but a complex issue. The large size of the character ...
متن کاملA Segmentation-free Approach to Recognise Printed Sinhala Script
Majority of character recognition algorithms such as the use of ANNs needs segmentation of the script prior to recognition. Contrast to Western scripts, Brahmi descended South Asian scripts such as Sinhala consist of modifier symbols, which make the segmentation a difficult task that needs to be addressed as a separate issue. Further, the change of shape of the basic character (by violating mod...
متن کاملLexicon and hidden Markov model-based optimisation of the recognised Sinhala script
The Brahmi descended Sinhala script is used by 75% of the 18 million population in Sri Lanka. To the best of our knowledge, none of the Brahmi descended scripts used by hundreds of millions of people in South Asia, possess commercial OCR products. In the process of implementation of an OCR system for the printed Sinhala script which is easily adoptable to similar scripts [Premaratne, L., Assabi...
متن کاملA Neural Network Based Character Recognition System for Sinhala Script
Much effort has been extended in making a computer recognise both typed and handwritten characters automatically. Until quite recently, the focus of this endeavour has been on characters of English Language. As for Asian languages such as Sinhala and Tamil, little or no attention has been given. Methods currently widely used for character recognition for these languages are mainly those which i...
متن کاملOff-Line Sinhala Handwriting Recognition Using Hidden Markov Models
This paper describes a method to recognize off-line handwritten Sinhala characters, the language used by the majority of Sri Lanka. The classification approach is based on discrete hidden Markov models. A subset of the Sinhala alphabet was chosen for the study. The unknown characters are first pre-classified into one of three character groups, based on the structural properties of the text line...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001